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AAG12684 126 aa linear PLN 22-JAN-2001 

myb-related protein; 20671-21051 [Arabidopsis thaliana] . 

AAG12684 

AAG12684.1 GI:10092271 

locus AC025814 accession AC02 5814 . 7 

Arabidopsis thaliana (thale cress) 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; 
rosids; eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

1 (residues 1 to 126) 

Lin,X., Kaul,S., Town, CD., Benito, M. -I . , Creasy, T.H., Haas, B. J., 
Wu,D., Maiti,R., Ronning,C.M. , Koo,H., Fujii,CY., Utterback, T . R . , 
Barnstead, M. E . , Bowman, C . L . , White, 0., Nierman,W.C. and Fraser,C.M. 
Arabidopsis thaliana chromosome 1 BAC F22H5 genomic sequence 
Unpublished 

2 (residues 1 to 126) 
Lin, X. and Kaul, S . 
Direct Submission 

Submitted ( 15-MAR-2000 ) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockville, MD 20850, USA, xlin@tigr.org 

3 (residues 1 to 126) 
Town, CD. and Kaul,S. 
Direct Submission 

Submitted ( 12-SEP-2000 ) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockville, MD 20850, USA, cdtown@tigr.org 

4 (residues 1 to 126) 
Town, CD. and Kaul,S. 
Direct Submission 

Submitted (05-OCT-2000) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockville, MD 20850, USA, cdtown@tigr.org 

5 (residues 1 to 126) 
Town, CD. and Kaul,S. 
Direct Submission 

Submitted ( 12-OCT-2000 ) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockville, MD 20850, USA, cdtown@tigr.org 

6 (residues 1 to 126) 
Town, CD. PhD. 
Direct Submission 
Submitted (22-JAN-2001) 

Address all correspondence to : atStigr . org 

BAC clone F22H5 is from Arabidopsis thaliana chromosome 1 

The orientation of the sequence is from SP6 to T7 end of the BAC 

clone . 

Genes were identified by a combination of several methods: Gene 
prediction programs including Genscan+ (Chris Burge, 
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1), GeneMarkHMM (Mark Borodovsky, 
/GeneMark/ ) , GlimmerA (a variant 



crnl , and 
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GeneSplicer (Mihaela Pertea and Steven Salzberg, contact 
mpertea@tigr.org), searches of the complete seguence against a 
peptide database and the plant EST database at TIGR 
( http : / /www . tig r . org/tdb/tgi . sh t: rnl ) . Annotated genes are named to 
indicate the level of evidence for their annotation. Genes with 
similarity to other proteins are named after the database hits. 
Genes without significant peptide similarity but with EST 
similarity are named as unknown proteins. Genes without protein 
or EST similarity, that are predicted by more than two gene 
prediction programs over most of their length are annotated as 
hypothetical proteins. Genes encoding tRNAs are predicted by 
tRNAscan-SE (Sean Eddy, frc _f<^ o. e tl edu - - -u L/) . 

Simple repeats are identified by repeatmasker (Arian Smit, 

H 1 ,- : / ' f p . 7 -j' o kcr -1 ) . 

Method: conceptual translation. 

Location/Qualifiers 
1 . . 126 

/organism="Arabidopsis thaliana" 

/db_xref ="taxon : 37 0 2 " 

/chromosome=" 1 " 

/clone="F22H5" 

/ecotype=" Columbia" 
Protein 1. .126 

/product="myb-related protein; 20671-21051" 
Region 11. .54 

/ region_name="SANT" 

/note="SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding 
domains; smart00717" 
/db_xref="CDD: 47985 " 
CDS 1..126 

/gene="F22H5.3" 

/coded_by="AC025814 . 7 : 20955. .21335" 
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